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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 



APPLICANT: 


(A) 


NAME: Hoechst Aktiengesellschaft 


(B) 


STREET: 


(C) 


CITY: Frankfurt 


(D) 


FEDERAL STATE: - 


(E) 


COUNTRY: Germany 


(F) 


POSTAL CODE: 65926 


(G) 


TELEPHONE: 069-305-3005 


(H) 


TELEFAX: 069-35-7175 


(I) 


TELEX; - 



(ii) TITLE OF APPLICATION: Isolation of the genes for 

biosynthesizing pseudo-oligosaccharides from 
Streptomyces glaucescens GLA.O and their use 

(iii) NUMBER OF SEQUENCES: 13 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 
#1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO.: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



10 



25 
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(ix) FEATURES: 

(A) NAME/KEY: exon 

(B) LOCATION: 1..22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1 : 

CSGGSGSSGC SGGSTTCATS GG 22 

(2) INFORMATION FOR SEQ ID NO.: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURES: 
20 (A) NAME/KEY: exon 

(B) LOCATION: 1..24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 2: 



GGGWVCTGGY VSGGSCCGTA GTTG 24 



(2) INFORMATION FOR SEQ ID NO.: 3: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURES: 

(A) NAME/KEY: exon 
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(B) LOCATION: 1..546 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 3: 

(xi) SEQUENZBESCHREIBUNG : SEQ ID NO: 3: 

CCCGGGCGGG GCGGOGTTCA TCGGCTCCGC CTACGTCCGC CGGCTCCTCT CGCCCGGQGC 6 0 

CCCCGGCGGC GTCGCGGTGA CCGTCCTCGA CAAACTCACC TACGCCGGCA GCCTCGCCCG 120 

CCTGCACGCG GTGCGTGACC ATCCCGGCCT CACCTTCGTC CAGGGCGACG TGTGCGACAC 18 0 

CGCGCTCGTC GACACGCTGG CCGCGCGGCA CGACGACATC GTGCACTTCG CGGCCGAGTC 24 0 

GCACGTCGAC CGCTCCATCA CCGACAGCGG TGCCTTCACC CGCACCAACG TGCTGGGCAC 300 

CCAGGTCCTG CTCGACGCCG CGCTCCGCCA CGGTGTGCGC ACCCTCGTGC ACGTCTCCAC 360 

CGACGAGGTG IACi3GCICCC TCCJCUCACGCi GCJCCGCCGCG GAGAGCGACC CCCTGCTCCC 420 

GACCTCGCCG TACGCGGCGT CGAAGGCGGC CTCGGACCTC ATGGCGCTCG CCCACCACCG 480 

CACCCACGGC CTGGACGTCC GGGTGACCCG CTGTTCGAAC AACTACGGCC CGCACCAGTT 54 0 

CCCGGG 546 

(2) INFORMATION FOR SEQ ID NO.: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURES: 

(A) NAME/KEY: exon 

(B) LOCATION: 1..541 

(XI) SEQUENZBESCHREIBUNG: SEQ ID NO: 4: 

CCCCCGGTGC TGGTAGGGGC CGTAGTTCTT GCAGCAGCGG GTGATGCGCA CGTCCAGGCC 6 0 

GTGGCTGACC TGCATGGCCA GCGCGAGCAG GTCGCCCGAC GCCTTGGAGG TGGCATAGGG 12 0 

GCTGTTGGGG CGCAGCGGCT CGTCCTCCGT CCACGACCCC GTCTCCAGCG AGCCGTAGAC 180 

CTCGTCGGTG GACACCTGCA CGAAGGGGGC CACGCCGTGC CGCAGGGCCG CGTCGAGGAG 24 0 

TGTCTGCGTG CCGCCGGCGT T3GTCCGCAC GAACGCGGCG GCATCGAGCA GCGAGCGGTC 3 00 
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CACGTGCGAC TCGGCGGCGA GGTGCACGAC CTGGTCCTGG CCGGCCATGA CCCGGTCGAC 36 0 

CAGGTCCGCG TCGCAGATGT CGCCGTGGAC GAAGCGCAGC CGGGGGTGGT CGCGGACCGG 420 

GTCGAGGTTG GCGAGGTTGC CGGCGTAGCT CAGGGCGTCG AGCACGGTGA CGACGGCGTC 480 

GGGCGCCCCG TCCGGACCGA GGAOGGTGCG GACGTAGTGC OAGCCCATGA ACCCCGCCGC 54 0 

C 541 



(2) 



INFORMATION FOR SEQ ID NO.: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



(ii) 



MOLECULE TYPE: protein 



15 



(ix) FEATURES: 

(A) NAME/KEY: PCRstrE.Pep 

(B) LOCATION: 1..180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 5: 
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Ala Ala Gly Phe Met Gly Ser Hie Tyr Val Arg Thr Leu Leu Gly Pro 
15 io 15 

Asp Gly Pro Pro Asp Ala Val Val Thr Val Leu Asp Ala Leu Ser Tyr 
20 25 30 

Ala Gly Asn Leu Ala Asn Leu Asp Pro Val Arg Asp His Pro Arg Leu 
35 40 45 

Arg Phe Val His Gly Asp lie Cys Asp Ala Asp Leu Val Asp Arg Val 

50 55 60 

Met Ala Gly Gin Asp Gin Val Val His Leu Ala Ala Glu Ser His Val 
65 70 75 80 

Asp Arg Ser Leu Leu Asp Ala Ala Ala Phe Val Arg Thr Asn Ala Gly 
65 90 95 

Gly Tli j. Glii 7 hi. Leu Leu A»p Ala Ala Leu Arg His Gly Val Ala Pro 
100 105 110 

Phe Val Gin Val Ser Thr Asp Glu Val Tyr Gly Ser Leu Glu Thr Gly 
115 120 125 

Ser Trp Thr Glu Asp Glu Pro Leu Arg Pro Asn Ser Pro Tyr Ala Thr 
130 135 140 

Ser Lys Ala Ser Gly Asp Leu Leu Ala Leu Ala Met His Val Ser His 
145 150 155 160 

Gly Leu Asp val Arg lie Thr Arg Cys Ser Asn Asn Tyr Gly Pro Tyr 
165 170 175 

Gin His Pro Gly 
180 

(2) INFORMATION FOR SEQ ID NO.: 6: 

(t) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: protein 

(ix) FEATURES: 

(A) NAME/KEY: PCR acbD.Pep 

(B) LOCATION: 1.181 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 6: 
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Pro Gly Gly Ala Gly Phe lie Gly Ser Ala Tyr Val Arg Arg Leu Leu 
15 10 15 

Ser Pro Gly Ala Pro Gly Gly Val Ala Val Thr Val Leu Asp Lys Leu 

20 25 30 

Thr Tyx Ala Gly Ser Leu Ala Arg Leu Hie Ala Val Arg Asp His Pro 
35 40 45 

Gly Leu Thr Phe Val Gin Gly Asp Val Cys Asp Thr Ala Leu val Asp 

50 55 60 

Thr Leu Ala Ala Arg His Asp Asp lie Val His Phe Ala Ala Glu Ser 
65 70 75 80 

His Val Asp Arg Ser lie Thr Asp Ser Gly Ala Phe Thr Arg Thr Asn 
85 90 95 

Val Leu Gly Thr Gin Val Leu Leu Asp Aia Aia Leu Arg His uly Vai 
100 105 110 

Arg Thr Leu Val His Val Ser Thr Asp Glu Val Tyr Gly Ser Leu Pro 
115 120 125 

His Gly Ala Ala Ala Glu Ser Asp Pro Leu Leu Pro Thr Ser Pro Tyr 
130 135 140 

Ala Ala Ser Lys Ala Ala Ser Asp Leu Met Ala Leu Ala His His Arg 
145 150 155 160 

Thr His Gly Leu Asp Val Arg Val Thr Arg Cys Ser Asn Asn Tyx Gly 
165 170 175 

Pro His Gin Phe Pro 



180 



(2) 



INFORMATION FOR SEQ ID NO.: 7: 



(0 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 



5 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



MOLECULE TYPE: DNA (genomic) 



(ix) 



FEATURES: 



(A) NAME/KEY: "acarbose" biosynthesis gene cluster 

(B) LOCATION: 1..6854 



lb 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 7: 
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CTGCAGGGTT CCCTGGTGCA CCACCCOCCC CTGGTCGACG ACCAGGGCGC TGTCGCAGAT 60 

CGCGGCGATG TCGGCGATGT CGTGGCTGGT GAGCACCACG GTCCT G CCCA GTTCCCGGTG 120 

GGCGCGGTTG ACCAGCCGGC GCACCGCGTC CTTCAGCACC ATGTCGAGGC CGATCGTGGG 180 

CTCGTCCCAG AACACCACGG CCGGGTCGTG CAGCAGGCTC GCCGCGATCT CGGCGCGCAT 240 

GCGCTGTCCG AGGCTGAGCT GCCGCACGGG GGTGGACCCC AGCGCGTCGA TGTCGAGGAG 300 

GTCCCGGAAC AGGGCGAGGT TGCGCCGGTA GACCGGTCCG GGGATX7TCGT AGATGCGGCG 36 0 

CAGGATGCGG AAGGACTCGG GTACCGACAG GTCCCACCAG AGCTGGCTGC GCTGGCCGAA 42 0 

GACGACGCCG ATCGTGCGGG CGTTGCGCTG CCGGTGCCGG TAGGGCTCCA GCCCGGCGAC 4 80 

CGTGCAGCGG CCGGAGGTGG GGGTCATGAT GCCGGTCAGC ATCTTGATCG TGGTCGACTT 54 0 

GCCCGCTCCG TTGGCGCCGA TGTAGGCGGT CTTCGTGCCG GCCGGTATCT CGAAGGAGAC 600 

GTCGTCGACG GCGCGCACGA CTGCGGTACCG GCGGGTCAGG AGGGTGGAGA GGCTGCCGAG SSC 

CJ^GGCCGGGC TCGCGTTCGG CCAGCCGGAA CTCCTTGACG AGGTGTTCGG CCACGATCAC 720 

GCGATCACCC GCTCGACGGC CGTCTCCAGC AGGCGCAGGC CCTCGTCGAG CAGCGCCTCG 780 

TCGAGGGTGA ACGGCGGTGC CAGCCGCAGG ATGTGGCCGC CCAGGGAGGT GCGCAGCCCC 64 0 

AGGTCGAGGG CGGTGGTGTA GACGGCCCGG GCGGTCTCGG GGGCGGGTGC CCGGCCGACG 900 

GCGTCGGTGA CGAACTCCAG GCCCCACAGC AGTCCGAGGC CGCGTACCTG GCCGAGCTGG 96 0 

GGGAAGCGGG ACTCCAGGGC GCGCAGCCGC TCCTGGATGA GCTCGCCGAG GACGCGCACG 1020 

CGCTCGATCA GCCGGTCGCG CTCGACGACC TCCAGCGTGG CGCGGGCGGC GGCGATCCCC 108 0 

AGTGGGTTGC TCGCGTACGT CGAGGCGTAC GCCCCGGGGT GGCCGCCTCC GGCCTGCGCA 114 0 

GCTTCCGCGC GTCCGGCCAG CACGGCGAAG GGGAATCCGC TCGCGGTGCC CTTGGACAGC 1200 

ATCGCCAGGT CCGGCTCGAT GCCGAACAGT TCGCTGG CGA GGAAGGCGCC GGTGCGCCCG 1260 

CCGCCGGTGA GGACCTCGTC GGCGACGAGC AGCACGCCGC CGTCCCGGCA GGCGCCGGCG 132 0 

ATCCGCTCCC AGTAGCCGGG GGGCGGCACG ATGACGCCTG CCGCGCCGAG GACGGGTTCG 13 8 0 

AAGACCAGGG CCGAGACGTT GGGCTTCTCC GCGATGTGCC GGCGCACGAG GGTCGCGCAC 14 4 0 

CGCACGTCGC ACGAGGGGTA CTCCAGGCCC AGGGGACAGC GGTAGCCAGT AGGGGCTGTA 1500 

GCCAGCACGC TGTTGCCGCT GAAGGCCTGG TGGCCGATGT CCCAG±GGAC CAGCATCCGG 156 0 

GCGCCCATGG TCTTGCCGTG GAAGCCGTGG CGCAGGGCGC AGATCCGGTT GCGGCCCGGC 16 2 0 

GCGGCGGTCG CCTGGACGAC CCGCAGGGCG GCCTCGACCA CCTCCGCGCC GGTGGAGAAG 16 8 0 

AAGGCGTAGG TGTCGAGCTG TTCGGGCAGC AGCCTGGCGA GCAGTTCCAG CAGGCCGGCG 174 0 

CGCTCCGGCG TGGCGCTGTC GTGOACGTTC CACAGGCGGC GGGCCTGGGT GGTGAGTGCC 18 00 
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TCGACGACCT CCGGGTGCCC GTGGCCCAGT GACTGGGTGA GCGTCCCGCC CGCGAAGTCG 1860 
AGGTAGTGGT TOCCGTCCAG GTCGGTCAGA ACGGGACCGC GTCCCTCGGC GAAGACCCGG 1920 
CGTCCGTGGA CGGCTTCCTC GGAGGCGCCC GGCGCCAGGT GGCG GG CCTC CCGTGCCAGG I960 

TGCTGTGTCT GCCGTAAGCC TGTCATCGCT GCCTCTGCTC GTCGGACCGG CTGACGCGAT 2 04 0 

CGCCGGCGAA CTGCGTTGTG GCCCACCACG GTTGGGGCGG CTCGGCGCTG AGTCAAACAC 2100 

TTGAACACAC ACCGCTGCAA GAGTTTGCGG GTTGTTTCAG AAAGTTGTTG CGAGCGGCCC 2160 

CGGCACTCTG GTTGAGTCGA CGTGCTTACG GCGCCACCAC GCCTCACGTT CGAGGAGGGA 2 220 

CCTGTGAGAA CAAGCCCGCA GACCGACCCG CTCCCGCGGA GGCCGAGGTG AAGGCCCTGG 228 0 

TCCTGGCAGG TGGAACCGGC AGCAGACTGA GGCCGTTCAC CCACACCGCC GC CAAGCAGC 234 0 

TGCTCCCCAT CGCCAACAAG CCCGTGCTCT TCTACGCGCT GGAGTCCCTC GCCGCGGCGG 24 00 

GTGTCCGGGA GGCCGGCGTC GTCGTGGGCG CGTACGGCCG GGAGATCCGC GAACTCACCG 24 6 0 

GCGACGGCAC CGCGTTCGGG TTACG CATCA CCTACCTCCA CCAGCCCCGC CCGCTCGGTC 2520 

TCGCGCACGC GGTGCGCATC GCCCGCGGCT TCCTGGGCGA CGACGACTTC CTG CTGTACC 2 5 80 

TGGGGGACAA CTACCTGCCC CAGGGCGTCA CCGACTTCGC CCGCCAATCG GCCGCCGATC 2640 

CCGCGGCGGC CCGGCTGCTG CTCACCCCGG TCGCGGACCC GTCCGCCTTC GGCGTCGCGG 2700 

AGGTCGACGC GGACGGGAAC GTGCTGCGCT TGGAGGAGAA ACCCGACGTC CCGCGCAGCT 2760 

CGCTCGCGCT CATCGGCGTG TACGCCTTCA GCCCGGCCGT CCACGAGGCG GTACGGGCCA 2820 

TCACCCCCTC CGCCCGCGGC GAGCTGGAGA TCACCCACGC CGTGCAGTGG ATGATCGACC 2 88 0 

GGGGCCTGCG CGTACGGGCC GAGACCACCA CCCGGCCCTG GCGCGACACC GGCAGCGCGG 294 0 

AGGACATGCT GGAGGTCAAC CGTCACGTCC TGGACGGACT GGAGGGCCGC ATCGAGGGGA 30 00 

AGGTCGACGC GCACAGCACG CTGGTCGGCC GGGTCCGGGT GGCCGAAGGC GCGATCGTGC 3 06 0 

GCGGCTCACA CGTGCTGGGC CCGGTGGTGA TCGGCGCGGG TGCCGTCGTC AGCAACTCCA 3120 

GTGTCGGCCC GTACACCTCC ATCGGGGAGG ACTCCCGGGT CGAGCACAGC CCCATCGAGT 3180 

ACTCCGTCCT GCTGCGCGGC GCCCAGGTCG AGGGGGCGTC CCGCATCGAG GCGTCCCTCA 324 0 

TCGGCCGCGG CGCCGTCGTC GGCCCGGCCC CCCGTCTCCC GCAGGCTCAC CGACTGGTGA 3 300 

TCGGC3ACCA CAGCAAGGTG TATCTCACCC CATGACCACG ACCATCCTCG TCACCGGCGG 3 36 0 

AGCGGGCTTC ATTCGCTCCG CCTACGTCCG CCGGCTCCTG TCGCCCGGGG CCCCCGGCGG 3420 

CGTCGCGGTG ACCGTCCTCG ACAAACTCAC CTACGCCGGC AGCCTCGCCC GCCTGCACGC 34 80 

GGTGCGTGAC CATCCCGGCC TCACCTTCGT CCAGGGCGAC GTGTGCGACA CCGCGCTCGT 3 54 0 

CGACACGCTG GCCGCGCGGC ACGACGACAT CCTGCACTTC GCGGCCGAGT CGCACGTCGA 36 00 

CCGCTCCATC ACCGACAGCG GTGCCTTCAC CCGCACCAAC GTGCTGGGCA CCCAGGTCCT 3660 

GCTCGACGCC GCGCTCCGCC ACGGTGTGCG CACCTTCGTG CACGTCTCCA CCGACGAGGT 3720 

GTACGGCTCC CTCCCGCACG GGGCCGCCGC GGAGAGCGAC CCCCTGCTTC CGACCTCGCC 3 780 

GTACGCGGCG TCGAAGGCGC CCTCGGACCT CATGG CGCTC GCCCACCACC GCACCCACGG 3B4 0 
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CCTGGACCTC CGGCTGACCC CCTCTTCGAA CAACTTCGGC CCCCACCAGC ATCCCGAGAA 3 900 
GCTCATACCG CGCTTCCTGA CCAGCCTCCT GTCCGGCGGC ACCGTTCCCC TCTACGGCGA 3 96 0 

CGGGCQGCAC GTGCGCGACT GGCTGCACGT CGACGACCAC GTCAGGGCCG TCGAACTCGT 4 02 0 

CCGCGTGTCG GGCCGGCCGG GAGA GAT CT A CAACATCGGG GGCGGCACCT CGCTGCCCAA 4 080 

CCTGGAGCTC ACGCACCGGT TGCTCGCACT GTGCGGCGCG GGCCCGGAGC GCATCGTCCA 414 0 

CGTCGAGAAC CGCAAGGGGC ACGJICCGGCG CTACGCGGTC GACCACAGCA AGATCACCGC 4200 

GGAACTCGGT TACCGGCCGC GCACCGACTT CGCGACCGCG CTGGCCGACA CCGCGAAGTG 4260 

GTACGAGCGG CACGAGGACT GGTGGCGTCC CCTGCTCGCC GCGACATGAC GTCGGGCCGG 4 320 

ACCGCAACCA CCGGCCCCGG CCGGCACACC GCCGCCCGCG GCCGCTGGCC GGCCGGTCAG 43 80 

CGTCCGTGAG CCGGGCGCCG GCCGCCCCGC GGGCCGGCGG CGGTGGACCC CCGGACCACC 444 0 

AGTTCCGGCA TGAAGACGAA TTCXiermcGC cmccicircirtra TTCCGCTCAT CTCCTCCAGC 4 5 00 

AGTGCGTCCA CGGCGACCTG CCCCATCGCC TTGACGGGCT GTCTGATGGT GGTCAGGGGA 4 560 

GGGTCGGTGA AGGCCATGAG CGGCGAGTCG TCGAAGCCGA CCACCGAGAT GTCACCGGGA 4 620 

ACCGTGAGAC CCCGCCGGCG CGCGGCCCGC ACGGCGCCGA GGGCCATCAT GTCGCTGGCG 468 0 

CACATGACGG CGGTGCAGCC CAGGTCGATC AGCGCGGACG CGGCGGCCTG GCCCCCCTCC 474 0 

AGGGAGAACA GCGAGTGCTG CACGAGCTCC TCGGACTCCC GCGCCGACAC TCCCAGGTGC 4 800 

TCCCGCACGC CGGCCCGGAA CCCCTCGATC TTCCGCTGCA CCGGCACGAA GCGGGCGGGC 4 860 

CCGACGGCGA GGCCGACGCG CTCGTGCCCC AGCTCCGCCA GGTGCGCCAC GGCCAGGCGC 4 920 

ATCGCGGCCC GGTCGTCCGG GGAGACGAAG GGTGCCTCGA TCCGGGGCGA GAACCCGTTC 4 98 0 

ACGAGGACGA AGGGCACCTG CCGCTCGTGC AGCCGGCCGT ACCGTCCGGT CTCGGCGGTG 504 0 

GTGTCCGCGT GCAGTCCGGA GACGAAGATG ATGCCGGACA CCCCGCGGTC CACGAGCATC 5100 

TCCGTGAGTT CGTCCTCCCT CGAGCC?GCCC GGGGTCTGCG TGGCGAGCAC GQGCGTGTAG 516 0 

CCCTGACGCG TGAGCGCCTG CCCCATCACC TGGGCCAGTG CGGGGAAGAA GGGGTTGTCC 5220 

AGTTCGGGGG TGACCAGTCC GACCAGCTCG GCGCGGCGCT GTCGCGCCGG CTGCTCGTAG 5280 

CCCAGCGCGT CCAGTGCGGT C AGCA C C GAG TCGCGGGTGC CGGTGGCCAC ACCGCGCGCA 5 34 0 

CCGTTCAGCA CCCGGCTGAC CGTGGCCTTG CTGACGCCCG CCCGGGCTGC GATGTCGGCG 54 0 0 

AGCCGCATGG TCATGGCAAC GCACTCTACC TGTCGGGGCG TCAGGGCGTG CCCACCGCGC 54 6 0 

GCGGAACCGG CGGACTGCGG GGCACGGCCC GTCCGCCGCC CACGGACCAC GCGCCCGAAA 5520 

CGATGGCTCA AAATGCTTCC AG C AAATTG C CGCAACGTCT TTCGGCGGCT TTTCGA7CCT 55B0 

GTTACGTTCC TGGCAACCCC GGCGCCGCGC AGAAGCGGTT GGCGTGAGGC GTCCAGACCT 564 0 

CCGCCCGATT CCGGGATCAC TCAGGGGAGT TCACAATGCG GCGTGGCATT GCGGCCACCG 5700 

CGCTGTTCGC GGCTGTGGCC ATGACGGCAT CGGCGTGTGG CGGGGGCGAC AACGGCGGAA S76 0 

GCGGTACCGA CGCGGGCGGC ACGGAGCTGT CGGGGACCGT CACCTTCTGG GACACGTCCA 562 0 

ACGAAGCCGA GAAGGCGACG TACCAGGCCC TCGCQGAGGG CTTCGAGAAG GAGCACCCGA 58 8 0 
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AGGTCGACGT CAAGTACGTC AACGTCCCGT TCGGCGAGGC GAACGCCAAG TTCAAGAACG 
CCGCGGGCGG CAACTCCGCT GCCCCGGACG TGATGCGTAC GGAGGTCGCC TGGGTCGCGG 
ACTTCGCCAG CATCGG CT AC CTCGCCCCGC TCGACGGCAC GCCCGCCCTC GACGACGGGT 
CGGACCACCT TCCCCAGGGC GGCAGCACCA GGTACGAGGG GAAGACCTAC GCGGTCCCGC 
AGGTGATCGA CACCCTGGCG CTCTTCTACA ACAAGGAACT GCTGACGAAG GCCGGTGTCG 
AGGTGCCGGG CTCCCTCGCC GAGCTGAAGA CGGCCGCCGC CGAGATCACC GAGAAGACCG 
GCGCGAGCGG CCTCTACTGC GGGGCGACGA CCCGTACTTG GTTCCTGCCC TACCTCTACG 
GGGAGGGCGG CGACCTGGTC GACGAGAAGA ACAAGACCGT CACGGTCGAC GACGAAGCCG 
GTGTGCGCGC CTACCGCGTC ATCAAGGACC TCGTGGACAG CAAGGCGGCC ATCACCGACG 
CGTCCGACGG CTGGAACAAC ATGCAGAACG CCTTCAAGTC GOCCAAOCTC GCCATGATGG 
TCAACGGCCC CTGGGCCATC GAGGArOTTA AfV; man A rzr CCG CTTCAAG GACGCCGGCA 
ACCTGGGGGT CGCCCCCGTC CCGGCCGGCA GTGCCGGACA GGGCTCTCCC CAGGGCGGGT 
GGAACCTCTC GGTGTACGCG GGCTCGAAGA ACCTCGACGC CTCCTACGCC TTCGTGAAGT 
ACATGAGCTC CGCCAAGGTG CAGCAGCAGA CCACCGAGAA GCTGAGCCTG CTGCCCACCC 
GCACGTCCGT CTACGAGGTC CCGTCCGTCG CGGACAACGA GATGGTGAAG TTCTTCAAGC 
CGGCCGTCGA CAAGGCCGTC GAACGGCCGT GGATCGCCGA GGGCAATGCC CTCTTCGAGC 
CGATCCGGCT GCAG 



5940 

6000 

6060 

£120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6854 



(2) 



INFORMATION FOR SEQ ID NO.: 8: 



5 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



MOLECULE TYPE: protein 



(ix) 



FEATURES: 



(A) NAME/KEY: acbA 

(B) LOCATION: 1..240 



15 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO.: 8: 
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Val He Val Ala Glu His Leu Val Lys Glu Phe Arg Leu Ala Glu Arg 
1 5 io 15 

Glu Pro Gly Leu Leu Gly Ser Leu Ser Thr Leu Leu Thr Arg Arg Tyr 
20 25 30 

Arg Val Val Arg Ala Val Asp Asp Val Ser Phe Glu lie Pro Ala Gly 
35 40 45 

Thr Lys Thr Ala Tyr He Gly Ala Asn Gly Ala Gly Lys Ser Thr Thr 
SO 55 60 

He Lys Met Leu Thr Gly He Met Thr Pro Thr Ser Gly Arg Cys Thr 
65 70 75 80 

Val Ala Gly Leu Glu Pro Tyr Arg His Arg Gin Arg Asn Ala Arg Thr 
85 90 95 

lie Gly Val val Pne Gly Gin Arg Ser Gin Leu Trp Trp Asp Leu Ser 
100 105 110 

Val Pro Asp Ser Phe Arg He Leu Arg Arg He Tyr Asp He Pro Gly 
115 120 125 

Pro Val Tyr Arg Arg Asn Leu Ala Leu Phe Arg Asp Leu Leu Asp He 
130 135 140 

Asp Ala Leu Gly Ser Thr Pro Val Arg Gin Leu Ser Leu Gly Gin Arg 
1^5 150 155 160 

Met Arg Ala Glu He Ala Ala Ser Leu Leu His Asp Pro Ala Val Leu 
165 170 175 

Phe Trp Asp Glu Pro Thr lie Gly Leu Asp Met Val Leu Lys Asp Ala 
180 185 X90 

Val Arg Arg Leu Val Asn Arg Ala His Arg Glu Leu Gly Thr Thr Val 
195 200 205 

Val Leu Thr Ser His Asp He Ala Asp He Ala Ala He Cys Asp Ser 
210 215 220 

Ala Leu Val Val Asp Gin Gly Arg Val Val His Gin Gly Thr Leu Gin 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO.: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



15 



(ix) FEATURES: 

(A) NAME/KEY: acbB 

(B) LOCATION: 1..429 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 9: 

Met Thr Gly Leu Arg Gin Thr Gin His Leu Ala Arg Glu Ala Arg His 

15 10 15 

Leu Ala Pro Gly Ala Ser Glu Glu Ala Val His Gly Arg Arg Val Phe 
20 25 30 

Ala Glu Gly Arg Gly Pro Val Leu Thr Asp Leu Asp Gly Asn Gin Tyx 
35 40 45 

Leu Asp Phe Ala Ala Gly Thr Leu Thr Gin Ser Leu Gly His Gly His 
50 55 60 

Pro Glu Val Val Glu Ma Leu Thr Thr Gin Ala Arg Arg Leu Trp Asn 
65 70 75 80 

Val His Asp Ser Ala Thr Pro Asp Arg Ala Gly Leu Leu Glu Leu Leu 

85 90 95 
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Ala Arg Leu Lieu Pro Glu Gin Leu Asp Thr Tyr Ala Phe Phe Ser Thr 
100 105 110 

Gly Ala Glu Val Val Glu Ala Ala Leu Arg Val Val Gin Ala Thr Ala 
115 120 125 

Ala Pro Gly Arg Asn Arg lie Cys Ala Leu Arg His Gly Phe His Gly 
130 135 140 

Lye Thr Met Gly Ala Arg Met Leu Val His Trp Asp lie Gly His Gin 
145 150 155 160 

Ala Phe Ser Gly Asn Ser Val Leu Ala Thr Ala Pro Thr Gly Tyr Arg 
165 170 175 

Cys Pro Leu Gly Leu Glu Tyr Pro Ser Cys Asp Val Arg Cys Ala Thr 
180 185 190 

Leu val Arg Arg His lie Ala Glu Lys Pro Asn Val 5er Ala Leu Veil 
135 200 205 

Phe Glu Pro Val Leu Gly Ala Ala Gly Val lie Val Pro Pro Pro Gly 
210 215 220 

Tyr Trp Glu Arg lie Ala Gly Ala Cys Arg Asp Gly Gly val Leu Leu 
225 230 235 240 

Val Ala Asp Glu Val Leu Thr Gly Gly Gly Arg Thr Gly Ala Phe Leu 
245 250 255 

Ala Ser Glu Leu Phe Gly He Glu Pro Asp Leu Ala Met Leu Ser Lys 
260 265 270 

Gly TUr Ala Ser Gly Phe Pro Phe Ala Val Leu Ala Gly Arg Ala Glu 
275 280 285 

Ala Ala Gin Ala Gly Gly Gly His Pro Gly Ala Tyr Ala Ser Thr Tyr 
290 295 300 

Ala Ser Asn Pro Leu Gly He Ala Ala Ala Arg Ala Thr Leu Glu Val 
305 310 315 320 

Val Glu Arg Asp Arg Leu He Asp Arg Val Arg Val Leu Gly Glu Leu 
325 330 335 

He Gin Glu Arg Leu Arg Ala Leu Glu Ser Arg Phe Pro Gin Leu Gly 
340 34S 350 

Gin Val Arg Gly Leu Gly Leu Leu Trp Gly Leu Glu Phe Val Thr Asp 
355 360 365 

Ala Val Gly Arg Ala Pro Ala Pro Glu Thr Ala Arg Ala Val Tyr Thr 
370 375 380 

Thr Ala Leu Asp Leu Gly Leu Arg Thr Ser Leu Gly Gly His He Leu 
3B5 390 395 400 

Arg Leu Ala Pro Pro Phe Thr Leu Asp Glu Ala Leu Leu Asp Glu Gly 
405 410 415 

Leu Arg Leu Leu Glu Thr Ala Val Glu Arg Val He Ala 
420 425 



(2) 



INFORMATION FOR SEQ ID NO.: 10: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: protein 

(ix) FEATURES: 

(A) NAME/KEY: acbC 
10 (B) LOCATION: 1.. 355 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 1 0: 
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Val Lys Ala Leu Val Leu Ala Gly Gly Thr Gly Ser Arg Leu Arg Pro 
1 5 10 15 

Phe Thr His Thr Ala Ala Lys Gin Leu Leu Pro lie Ala Asn Lys Pro 
20 25 30 

Val Leu Phe Tyr Ala Leu Glu Ser Leu Ala Ala Ala Gly Val Arg Glu 
35 40 45 

Ala Gly Val Val Val Gly Ala Tyr Gly Arg Glu lie Arg Glu Leu Thr 
50 55 60 

Gly Asp Gly Thr Ala Phe Gly Leu Arg He Thr Tyr Leu His Gin Pro 
6 5 70 75 80 

Arg Pro Leu Gly Leu Ala His Ala Val Arg He Ala Arg Gly Phe Leu 
85 90 95 

Gly Asp Asp Asp Phe Leu Leu Tyr Leu Gly Asp Asn Tyr Leu Pro Gin 
100 105 110 

Gly Val Thr Asp Phe Ala Arg Gin Ser Ala Ala Asp Pro Ala Ala Ala 
115 120 125 

Arg Leu Leu Leu Thr Pro Val Ala Asp Pro Ser Ala Phe Gly Val Ala 
130 135 140 

Glu Val Asp Ala Asp Gly Asn Val Leu Arg Leu Glu Glu Lys Pro Asp 
145 ISO 155 160 

Val Pro Arg Ser Ser Leu Ala Leu He Gly Val Tyr Ala Phe Ser Pro 
165 170 175 

Ala Val His Glu Ala Val Arg Ala He Thr Pro Ser Ala Arg Gly Glu 
180 185 190 

Leu Glu He Thr His Ala Val Gin Trp Met He Asp Arg Gly Leu Arg 
195 200 205 

Val Arg Ala Glu Thr Thr Thr Arg Pro Trp Arg Asp Thr Gly Ser Ala 
210 215 220 

Glu Asp Met Leu Glu Val Asn Arg His Val Leu Asp Gly Leu Glu Gly 
225 230 235 240 

Arg He Glu Gly Lys Val Asp Ala His Ser Thr Leu Val Gly Arg Val 
245 250 255 

Arg Val Ala Glu Gly Ala He Val Arg Gly Ser His Val Val Gly Pro 
260 265 270 

Val Val He Gly Ala Gly Ala Val Val Ser Asn Ser Ser Val Gly Pro 
275 280 285 

Tyr Thr Ser He Gly Glu Asp Cys Arg Val Glu Asp Ser Ala He Glu 
290 295 300 

Tyr Ser Val Leu Leu Arg Gly Ala Gin Val Glu Gly Ala Ser Arg He 
305 310 315 320 

Glu Ala Ser Leu He Gly Arg Gly Ala Val Val Gly Pro Ala Pro Arg 
325 330 335 

Leu Pro Gin Ala His Arg Leu Val He Gly Asp His Ser Lys Val Tyr 
340 345 35 0 

Leu Thr Pro 
355 



10 
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(2) INFORMATION FOR SEQ ID NO.: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURES: 

(A) NAME/KEY: acbD 

(B) LOCATION: 1..325 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 11: 

Met Thr Thr Thr lie Leu Val Thr Gly Gly Ala Gly Phe He Arg Ser 
1 5 io 15 

Ala Tyr Val Arg Arg Leu Leu Ser Pro Gly Ala Pro Gly Gly Val Ala 
20 25 30 

Val Thr Val Leu Asp Lys Leu Thr Tyr Ala Gly Ser Leu Ala Arg Leu 
35 40 45 

His Ala Val Arg Asp His Pro Gly Leu Thr Phe Val Gin Gly Asp Val 
50 55 60 

Cys Asp Thr Ala Leu Val Asp Thr Leu Ala Ala Arg His Asp Asp He 
65 70 75 80 

Val His Phe Ala Ala Glu Ser His Val Asp Arg Ser lie Thr Asp Ser 
85 90 95 

Gly Ala Phe Thr Arg Thr Asn Val Leu Gly Thr Gin Val Leu Leu Asp 
100 105 no 

Ala Ala Leu Arg His Gly Val Arg Thr Phe Val His Val Ser Thr Asp 
115 120 125 

Glu Val Tyr Gly Ser Leu Pro His Gly Ala Ala Ala Glu Ser Asp Pro 
130 135 i 4 o 



WO 97/47748 



57 



PCT/EP97/02826 



Leu Leu Pro Thr Ser Pro Tyr Ala Ala Ser Lys Ala Ala Ser Asp Leu 
145 150 155 160 

Met Ala Leu Ala His His Arg Thr His Gly Leu Asp Val Arg Val Thr 
165 170 175 

Arg Cys Ser Asn Asn Phe Gly Pro His Gin His Pro Glu Lys Leu lie 
180 185 190 

Pro Arg Phe Leu Thr Ser Leu Leu Ser Gly Gly Thr Val Pro Leu Tyr 
155 200 205 

Gly Asp Gly Arg His Val Arg Asp Trp Leu His Val Asp Asp His Val 
210 215 220 

Arg Ala Val Glu Leu Val Arg Val Ser Gly Arg Pro Gly Glu lie Tyr 
225 230 235 240 

Asn lie Gly Gly Gly Thr Ser Leu Pro Asn Leu Glu Leu Thr His Arg 
245 250 255 

Leu Leu Ala Leu Cys Gly Ala Gly Pro Glu Arg lie Val His Val Glu 
260 265 270 

Asn Arg Lys Gly His Asp Arg Arg Tyr ALa Val Asp His Ser Lys lie 
275 280 2B5 

Thr Ala Glu Leu Gly Tyr Arg Pro Arg Thr Asp Phe Ala Thr Ala Leu 
290 295 300 

Ala Asp Thr Ala Lys Trp Tyr Glu Arg His Glu Asp Trp Trp Arg Pro 
305 310 315 320 

Leu Leu Ala Ala Thr 

325 

(2) INFORMATION FOR SEQ ID NO.: 12: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: protein 

(ix) FEATURES: 

(A) NAME/KEY: acbE 

(B) LOCATION: 1 .345 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 12: 
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Met Thr Met Arg Leu Ala Asp lie Ala Ala Arg Ala Gly Val Ser Lys 
1 5 10 15 

Ala Thr Val Ser Arg Val Leu Asn Gly Ala Arg Gly Val Ala Thr Gly 
20 25 30 

Thr Arg Asp Ser Val Leu Thr Ala Leu Asp Ala Leu Gly Tyr Glu Gin 
35 40 45 

Pro Ala Arg Gin Arg Arg Ala Glu Leu Val Gly Leu Val Thr Pro Glu 
50 55 60 

Leu Asp Asn Pro Phe Phe Pro Ala Leu Ala Gin Val Met Gly Gin Ala 
" 70 75 so 

Leu Thr Arg <?ln Gly Tyr Thr Pro Val Leu Ala Thr Gin Thr Pro Gly 
85 90 9S 

Gly Ser Thr Glu A-p Glu Leu Thr Glu Met Leu Val Asp Arg Gly Val 

100 105 xio 

Ser Gly li e n e Pne Val Ser Gly ^ Hifl JQa ^ Thr ^ 
115 120 12 s 

Thr Gly Arg Tyr Gly Arg Leu His Glu Arg Gin Val Pro Phe Val Leu 
130 "5 140 

Val Asn Gly Phe Ser Pro Arg He Glu Ala Pro Phe Val Ser Pro Asp 
145 150 155 ii5 

As P Arg Ala Ala Met Arg Leu Ala Val Ala His Leu Ala Glu Leu Gly 
165 170 175 



HIS Glu Arg Val Gly Leu Ala Val Gly Pro Ala Arg Phe Val Pro Val 
180 185 130 

Gin Arg Lys He Glu Gly Phe Arg Ala Gly Val Arg Glu His Leu Gly 
195 200 2 o5 

Val Ser Ala Arg Glu Ser Glu Glu Leu Val Gin His Ser Leu Phe Ser 
210 215 220 

Leu Glu Gly Gly Gin Ala Ala Ala Ser Ala Leu lie Asp Leu Gly Cys 
225 230 235 240 

Thr Ala Val Met Cys Ala Ser Asp Met Met Ala Leu Gly Ala Val Arg 
2 *5 250 255 

Ala Ala Arg Arg Arg Gly Leu Thr Val Pro Gly Asp He Ser Val Val 
260 265 270 

Gly Phe Asp Asp Ser Pro Leu Met Ala Phe Thr Asp Pro Pro Leu Thr 
275 280 285 

Thr lie Arg Gin Pro Val Lys Ala Met Gly Gin Val Ala Val Asp Ala 
290 295 3 0 o 

Leu Leu Glu Glu Met Ser Gly Thr Pro Pro Pro Arg Thr Glu Phe Val 

310 31S 3 20 

Phe Met Pro Glu Leu Val Val Arg Gly Ser Thr Ala Ala Gly Pro Arg 
325 330 7 335 

Gly Gly Arg Arg Pro Ala His Gly Arq 
340 345 
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(2) INFORMATION FOR SEQ ID NO.: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(ix) FEATURES: 

(A) NAME/KEY: acbF 

(B) LOCATION: 1..393 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 13: 
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Met Arg Arg Gly lie Ala Ala Thr Ala Leu Phe Ala Ala Val Ala Met 
15 10 15 

Thr Ala Ser Ala Cys Gly Gly Gly Asp Asn Gly Gly Ser Gly Thr Asp 
20 25 30 

Ala Gly Gly Thr Glu Leu Ser Gly Thr Val Thr Phe Trp Asp Thr Ser 
35 40 45 

Asn Glu Ala Glu Lys Ala Thr Tyr Gin Ala Leu Ala Glu Gly Phe Glu 
50 55 60 

Lys Glu His Pro Lys Val Asp Val Lys Tyr Val Asn Val Pro Phe Gly 
65 70 75 80 

Glu Ala Asn Ala Lys Phe Lys Asn Ala Ala Gly Gly Asn Ser Gly Ala 
85 90 95 

Pro Asp Val Met Arg Thr Glu Val Ala Trp Val Ala Asp Phe Ala Ser 
100 105 110 

lie Gly Tyr Leu Ala Pro Leu Asp Gly Thr Pro Ala Leu Asp Asp Gly 
115 120 125 

Ser Asp His Leu Pro Gin Gly Gly Ser Thr Arg Tyr Glu Gly Lys Thr 
130 135 !4o 

Tyr Ala Val Pro Gin Val lie Asp Thr Leu Ala Leu Phe Tyr Asn Lys 
145 150 155 160 

Glu Leu Leu Thr Lys Ala Gly Val Glu Val Pro Gly Ser Leu Ala Glu 
165 170 175 

Leu Lys Thr Ala Ala Ala Glu lie Thr Glu Lys Thr Gly Ala Ser Gly 
180 18 5 190 

Leu Tyr Cys Gly Ala Thr Thr Arg Thr Trp Phe Leu Pro Tyr Leu Tyr 
195 200 205 

Gly Glu Gly Gly Asp Leu Val Asp Glu Lys Asn Lys Thr Val Thr Val 
210 215 220 

Asp Asp Glu Ala Gly Val Arg Ala Tyr Arg Val lie Lys Asp Leu Val 
225 230 235 240 

Asp Ser Lys Ala Ala lie Thr Asp Ala Ser Asp Gly Trp Asn Asn Met 
245 250 255 

Gin Asn Ala Phe Lys Ser Gly Lys Val Ala Met Met Val Asn Gly Pro 
260 265 270 

Trp Ala He Glu Asp Val Lys Ala Gly Ala Arg Phe Lys Asp Ala Gly 
275 280 285 

Asn Leu Gly Val Ala Pro Val Pro Ala Gly Ser Ala Gly Gin Gly Ser 
290 295 300 

Pro Gin Gly Gly Trp Asn Leu Ser Val Tyr Ala Gly Ser Lys Asn Leu 
305 310 315 320 
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Asp Ala Ser Tyr 



Gin Gin Thr Thx 
340 

Tyr Glu Val Pro 
35S 

Pro Ala Val Asp 
370 

Ala Leu Phe Glu 
365 



Ala Phe Val Lye 
325 

Glu Lys Leu Ser 



ser val Ala Asp 
360 

Lys Ala Val Glu 
375 

Pro lie Arg Leu 
390 



Tyr Met Ser Ser 
330 

Leu Leu Pro Thr 
345 

Asn Glu Met Val 



Arg Pro Trp lie 
380 

Gin 



Ala Lys Val Gin 
335 

Arg Thx Ser Val 
350 



Lys Phe Phe Lys 
365 

Ala Glu Gly Asn 



